Title of dissertation : TEMPORAL AND SPATIAL ALIGNMENT OF MULTIMEDIA SIGNALS
نویسندگان
چکیده
Title of dissertation: TEMPORAL AND SPATIAL ALIGNMENT OF MULTIMEDIA SIGNALS Hui Su, Doctor of Philosophy, 2014 Dissertation directed by: Professor Min Wu Department of Electrical and Computer Engineering With the increasing availability of cameras and other mobile devices, digital images and videos are becoming ubiquitous. Research efforts have been made to develop technologies that utilize multiple pieces of multimedia information simultaneously. This dissertation focuses on the temporal and spatial alignment of multimedia signals, which is a fundamental problem that needs to be solved to enable such applications dealing with multiple pieces of multimedia data. The first part of the dissertation addresses the synchronization of multimedia signals. We propose a new modality for audio and video synchronization based on the electric network frequency (ENF) signal naturally embedded in multimedia recordings. Synchronization of audio and video is achieved by aligning the ENF signals. The proposed method offers a significant departure to tackling the audio/video synchronization problem from existing work, and a strong potential to address previously untractable scenarios. Estimation of the ENF signal from video is a challenging task. In order to address the problem of insufficient sampling rate of video, we propose to exploit the rolling shutter mechanism commonly adopted in CMOS camera sensors. Several techniques are designed to alleviate the distortions of motions and brightness changes in videos for ENF estimation. We also address several challenges that are unique to the synchronization of digitized analog audio recordings. Speed offset often occurs in digitized analog audio recordings due to the inconsistency in the tape’s rolling speed. We show that the ENF signal captured by the original analog audio recording can be retained in the digitized version. The ENF signal is considered approximately as a single-tone signal and used as a reference to detect and correct speed offsets automatically. A complete multimedia application system often needs to jointly consider both temporal synchronization and spatial alignment. The last part of the dissertation examines the quality assessment of local image features for efficient and robust spatial alignment. We propose a scheme to evaluate the quality of SIFT features in terms of their robustness and discriminability. A quality score is assigned to every SIFT feature based on its contrast value, scale and descriptor, using a quality metric kernel that is obtained in a one-time training phase. Feature selection is performed by retaining features with high quality scores. The proposed approach is also applicable to other local image features, such as the Speeded Up Robust Features (SURF). TEMPORAL AND SPATIAL ALIGNMENT OF MULTIMEDIA SIGNALS
منابع مشابه
Plume outflow intrusion impact on acoustical signal fluctuations in a pre-stratified environment
Existence of outflow intrusion introduces small-scale turbulence that perturbs the vertically stratified character of the sound velocity and causes spatial and temporal fluctuations of the sound propagation. In this experimental study, we have investigated acoustic wave propagation with frequency of 50 kHz in a pre-stratified environment with intrusion of a turbulent plume while the si...
متن کاملDetermination of Spatial-Temporal Correlation Structure of Troposphere Ozone Data in Tehran City
Spatial-temporal modeling of air pollutants, ground-level ozone concentrations in particular, has attracted recent attention because by using spatial-temporal modeling, can analyze, interpolate or predict ozone levels at any location. In this paper we consider daily averages of troposphere ozone over Tehran city. For eliminating the trend of data, a dynamic linear model is used, then some featu...
متن کاملOptimized Seizure Detection Algorithm: A Fast Approach for Onset of Epileptic in EEG Signals Using GT Discriminant Analysis and K-NN Classifier
Background: Epilepsy is a severe disorder of the central nervous system that predisposes the person to recurrent seizures. Fifty million people worldwide suffer from epilepsy; after Alzheimer’s and stroke, it is the third widespread nervous disorder.Objective: In this paper, an algorithm to detect the onset of epileptic seizures based on the analysis of brain electrical signals (EEG) has b...
متن کاملSpatial and Temporal Analysis of Dusty Days in Iran
Iran is one of the world's arid regions and exposed to dust particles in the air. This is important in spatial and temporal terms as well as the affects human health. In this paper a day when due to the amount of dust particles visibility reduces to 5,000 m is considered the dusty day. In the spatial and temporal analysis of dusty days 38 synoptic stations were studied. In order to conduct data...
متن کاملSpatial and Temporal Analysis of Dusty Days in Iran
Iran is one of the world's arid regions and exposed to dust particles in the air. This is important in spatial and temporal terms as well as the affects human health. In this paper a day when due to the amount of dust particles visibility reduces to 5,000 m is considered the dusty day. In the spatial and temporal analysis of dusty days 38 synoptic stations were studied. In order to conduct data...
متن کامل